Model Quantization, Inference Optimization, GGUF Format, Privacy-preserving AI

Do LLMs Signal When They're Right? Evidence from Neuron Agreement
arxiv.org·23h
🧠Neuromorphic Hardware
Flag this post
Toward provably private insights into AI use
research.google·1d·
Discuss: Hacker News
🏠Self-hosted AI
Flag this post
Quantum-Resistant Federated Learning with Lattice-Based Homomorphic Encryption for Edge AI Systems
dev.to·1d·
Discuss: DEV
🏠Self-hosted AI
Flag this post
TinyML is the most impressive piece of software you can run on any ESP32
xda-developers.com·17h
📱Edge AI
Flag this post
From Lossy to Lossless Reasoning
manidoraisamy.com·9h·
Discuss: Hacker News
📱Edge AI
Flag this post
MIT’s Survey On Accelerators and Processors for Inference, With Peak Performance And Power Comparisons
semiengineering.com·10h
🏗️AI Infrastructure
Flag this post
StreetMath: Study of LLMs' Approximation Behaviors
arxiv.org·23h
🏠Self-hosted AI
Flag this post
Opportunistically Parallel Lambda Calculus
dl.acm.org·1d·
Discuss: Hacker News
🔍Query Compilers
Flag this post
Your Transformer is Secretly an EOT Solver
elonlit.com·22h·
Discuss: Hacker News
⏱️Time-series Optimization
Flag this post
Beyond the Hype: The Hidden Economics of AI Inference
dev.to·6h·
Discuss: DEV
🏗️AI Infrastructure
Flag this post
Building a Rules Engine from First Principles
towardsdatascience.com·1d
λFunctional Programming
Flag this post
Thought Engineering
pranavc28.github.io·23h·
Discuss: Hacker News
🏠Self-hosted AI
Flag this post
Writing an LLM from scratch, part 25 – instruction fine-tuning
gilesthomas.com·2d·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Quantum-Resistant Federated Learning with Lattice-Based Homomorphic Encryption for Edge AI Systems
dev.to·17h·
Discuss: DEV
🤝Federated Learning
Flag this post
Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error
arxiv.org·23h
🏠Self-hosted AI
Flag this post
zFLoRA: Zero-Latency Fused Low-Rank Adapters
arxiv.org·23h
🏗️AI Infrastructure
Flag this post
Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
arxiv.org·23h
🏗️AI Infrastructure
Flag this post
Model Inversion with Layer-Specific Modeling and Alignment for Data-Free Continual Learning
arxiv.org·23h
📱Edge AI
Flag this post
Holographic theory of LLMs: explains unbreakable bias and cross-model infection
habr.com·2d·
Discuss: Hacker News
🏠Self-hosted AI
Flag this post